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1 3 JUNI 2002 
Modtaget 

FLUOROPHORE COMPLEMENTATION PRODUCTS 

Field of invention 

The present invention relates to various split fluorophore complementation products, 
especially ways to obtain intense systems with GFP 

5 Background of the invention 

It has been suggested to use the reassembly of certain enzyme fragments of the 
complete enzyme as a measure of protein-protein interactions Johnsson and Varshavsky 
(Johnsson, N , Varshavsky, A (1994) Proc Natl Acad Sci U S A 91, 10340-10344) 
disclose reassembly of Ubiquitm This reassembly is detected through the irreversible 
10 cleavage of the fusion by Ubiquitin protease and release of a reporter As opposed to the 
two-hybrid technique, this technique includes the possibility of monitonng a protein-protein 
interaction as a function of time, at the natural sites of this interaction in a living cell 

Similar systems are suggested for the reassembly of other proteins including p- 
galactosidase {Rossi. F , Charlton, C A , Blau, H M (1997) Proc Natl Acad Sci U S A 

15 94. 8405-8410), dihydrofolate reductase (DHFR, WO98/34120), and p-lactamase 

(Wehrman, T . Kleaveland, B , Her, J H , Balint, R F , Blau. H M (2002) Proc Natl Acad 
Sci U S A 99,3469-3474) The basic concept is that by splitting a functional protein in 
two fragments, the function is lost The two fragments are transformed or transfected into 
cells fused in frame to proteins X and Y. respectively Binding between proteins X and Y 

20 will bnng the two fragments close together, increasing the local concentration of the 
complementing fragments, induce folding of these fragments and produce a functional 
protein with an activity that is similar to that of the non-fragmented protein If the function 
is DHFR activity, the cells will survive only if proteins X and Y bind to each other 

Recently, it has been descnbed to use a somewhat similar system for the assisted 
>5 reassembly and folding of fragments of fluorescent proteins As the function is 

fluorescence, the cells will emit light upon excitation only if protein X and protein Y bind to 
each other thereby assisting complementation 
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Ghosh (I Ghosh, A D Hamilton, L Regan (2000) J Am Chem Soc 122, 5658-5659, 
WO01/87919) descnbes the use of a GFP vanant called sg100 (F64L, S65C. Q80R, 
Y1 51 L, I167T and K238N) This GFP has single fluorescence excitation and emission 
peaks at 475 nm and 505 nm, respectively (similar to sg25 descnbed by Palm (Palm, 
5 G J , Zdanov, A , Gaitanans, G A , Stauber, R , Pavlakis, G N , Wlodawer, A (1997) Nat 
Struct Biol 4, 361-365)) 

Functional GFP fragment complementation is accomplished by co-expressing two 
independent peptides composed of the first 157 N-terminal amino acfcls of this GFP 
(NGFP) and the remaining 81 C-terminal amino acids (starting form residue 158) of this 
10 GFP (CGFP) with each of the GFP peptide fragments being fused to interacting leucine 
zipper peptides that serve to associate the fragments 

Nagai (T Nagai, A Sawano, E S Park, A Miyawaki (2001) Proc Natl Acad USA 98, 
3197-3202) tests a yellow fluorescent GFP variant that has the following mutations S65G, 
V68L, Q69K, S72A, T203Y This variant was split between residues N144 and Y145 
15 within the open 1 29-145 loop region, and the peptides fused to M1 3 and calmodulin, 
respectively, for use in a Ca 2 * assay However, when the constructs were transfected 
individually into HeLa cells, the assay was not reliable 

Hu (Chang-Deng Hu, Yum Chinenov, and Tom Kerppola (2002) Mol Cell 9, 789-798) 
enables the use of the GFP variant called EYFP (S65G, S72A, T203Y) The two GFP 
20 fragments which are descnbed to work are the first 1 54 amino acids of GFP in one 
construct and amino acids 155 through 238 in the other construct 

However, there is still a need for alternative GFP's for use in this technology 
Summary of the invention 

The present application discloses that certain GFPs can be reassembled and form a 
25 functional fluorescent protein when expressed as two independent proteins For example, 
when EGFP is expressed in mammalian cells, choosing a split site located in a loop 
region between the residues that form the beta-sheet structures of the GFP beta-barrel 
results in intense fluorescence (Example 5 and Example 7) 
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Detailed disclosure 

The non-fluorescent fragments of fluorescent proteins that can be combined to form one 
functional fluorescent unit are usually produced by splitting the coding nucleotide 
sequence of one fluorescent protein at an appropnate site and expressing each 
5 nucleotide sequence fragment independently The fluorescent protein fragments may be 
expressed alone or in fusion with one or more protein fusion partners 

Thus, one aspect of the invention relates to two GFP fragments comprising an N-termmal 
fragment of GFP, comprising a continuous stretch of amino acids from amino acid number 
1 to amino acid number X of GFP, wherein the peptide bond between amino acid number 
10 X and amino acid number X+1 is within a loop of GFP, the two GFP fragments also 
comprise a C-terminal fragment of GFP, comprising a continuous stretch of ammo acids 
from amino acid number X+1 to amino acid number 238 of GFP 

Amino acid 1 is meant to indicate the first amino acid of GFP Ammo acid 238 is meant to 
indicate the last amino acid of the GFP 

15 All residues are numbered according to the numbering of wild type A victona GFP 
(GenBank accession no M62653) and said numbenng also applies to equivalent 
positions in homologous sequences exemplified by alignment of fluorescent protein 
sequences in Example 1 Thus, when working with truncated GFPs (compared to wild 
type GFP) or when working with GFPs with additional amino acids, the numbenng is 

20 relative to the alignment 

Green Fluorescent Protein (GFP) is a 238 ammo acid long protein derived from the 
jellyfish Aequorea Victoria (SEQ ID NO 1) However, fluorescent proteins have also been 
isolated from other members of the Coelenterata, such as the red fluorescent protein from 

25 Dtscosoma sp (Matz f MV etal 1999, Nature Biotechnology 17 969-973), GFP from 
Rentlla reniformis, GFP from Renilla Muellen or fluorescent proteins from other animals, 
fungi or plants The GFP exists in various modified forms including the blue fluorescent 
variant of GFP (BFP) disclosed by Heim et al (Heim, R etal, 1994. Proc Natl Acad Sa 
91 26, pp 12501-12504) which is a Y66H variant of wild type GFP, the yellow fluorescent 

30 vanant of GFP (YFP) with the S65G, S72A, and T203Y mutations ( WO98/06737), the cyan 
fluorescent vanant of GFP (CFP) with the Y66W colour mutation and optionally the F64L, 
S65T, N146I, M153T. V163A folding/solubility mutations (Heim, R , Tsien, R Y (1996) Curr 
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Biol 6, 178-182) The most widely used variant of GFP is EGFP with the F64L and S65T 
mutations (WO 97/1 1 094 and WO96/23810) and insertion of one valine residue after the 
first Met The F64L mutation is the amino acid in position 1 upstream from the chromophore 
GFP containing this folding mutation provides an increase in fluorescence intensity when the 
5 GFP is expressed in cells at a temperature above about 30°C 

In the nomenclature of GFP, an "E" is placed in front of the GFP (EGFP. EYFP, ECFP) to 
indicate that this particular GFP is encoded by a nucleic acid with codon usage optimised for 
mammalian cells Most of these proteins also have an extra valine residue inserted after the 
initial methionine residue, Met 1 This residue is not considered in the numbenng of the 
10 residues Thus, in a preferred embodiment, the GFP of the present invention is selected 
from the group consisting of EGFP, EYFP, ECFP, dsRed and Renilla GFP 

Throughout the examples of the present application, EGFP is used Thus, in a preferred 
embodiment of the invention, the GFP is EGFP 

15 In the present context, the numbering of wild-type GFP (SEQ ID NO 1) (Chalfie, M , Tu, 
Y , Euskirchen, G , Ward, W W , Prasher, D C (1994) Science 263, 802-805, this vanant 
of GFP has a histidine residue in position 231) is used Based on the crystal structure of 
GFP (Yang. F , Moss, L G . Phillips. G N (1996) Nat Biotech 14, 1246-1251) Figure 5. 
Table 1 and the data presented in the examples, it is evident that a split in almost any 

20 loop will be re-assembled following appropriate spatial approximation to the 

complementation fragments assisted by the interaction of the conjugated proteins For the 
purpose of this application the term "loop" shall be understood as a turn or element of 
irregular secondary structure 

Thus, in one aspect, the invention relates to two GFP fragments as described above, 
25 wherein X is 7, 8, 1 1 or 12, preferably X is 9 or 10 within the Thr9-Val1 1 loop, or 
wherein X is 21 , 22, 25 or 26, preferably X is 23 or 24 within the Asn23-His25 loop, or 
wherein X is 36, 37. 40 or 41 . preferably X is 38 or 39 within the Thr38-Gly40 loop, or 
wherein X is 46, 47, 56 or 57. preferably X is between 48 and 55 1 e X is 48. 49, 50. 51. 
52, 53, 54 or 55 within the Cys48-Pro56 loop, or 
30 wherein X is 70, 71 , 76 or 77. preferably X is between 72 and 75 1 e X is 72. 73. 74 or 75 
within the Ser72-Asp76 loop, or 

wherein X is 79, 80. 83 or 84, preferably X is 81 or 82 within the His81-Phe83 loop, or 
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wherein X is 86. 87, 90 or 91. preferably X is 88 or 89 within the Met88-Glu90 loop, or 

wherein X is 99. 100. 103 or 104, preferably X is 101 or 102 within the Lys101-Asp103 
loop, or 

wherein X is 112. 113, 118 or 119, preferably X is between 114 and 117 1 e X is 1 14, 115. 
5 1 1 6 or 1 1 7 within the Phe1 1 4-Thr1 1 8 loop, or 

wherein X is 126, 127, 145 or 146, preferably X is between 128 and 144 1 e X is 128, 129. 
130, 131, 132, 133, 134, 135, 136, 137, 138. 139. 140, 141. 142. 143. 144 within the 
lie 128-Tyr145 loop, or 

wherein X is 152, 153, 160 or 161, preferably X is between 154 and 159 1 e X is 154. 155. 
10 1 56. 1 57, 1 58 or 1 59 within the Ala1 54-Gly160 loop, or 

wherein X is 169, 170. 175, 176, preferably X is between 171 and 174 1 e X is 171, 172, 

173 or 174 within the Ile171-Ser175 loop, or 
wherein X is 186, 187. 197 or 198. preferably X is between 188 and 196 i e X is 188. 189. 
190, 191, 192, 193, 194, 195 or 196 within the Ile188-Asp197 loop, or 
,15 wherein X is 208, 209, 215 or 216, preferably X is between 210 and 214 1 e X is 210. 211. 
212. 213 or 214 within the Asp210-Art215 loop 

Table 1 GFP secondary structures. GFP wild type sequence amino acid numbenng a 
and P indicate g-helical and (3-sheet seco ndary structures, respectively 

Name Position 

Helix 1 Lys3 - Thr9 
Sheet 1 Val1 1 - Asn23 
Sheet 2 His25 - Thr38 
Sheet 3 Gly40 - Cys48 
Helix 2 Pro56 - Ser72 
Helix 3 Asp76 - His81 
Helix 4 Phe83 - Met88 
Sheet 4 Glu90-Lys101 
Sheet 5 Asp103 -Phe1 14 
Sheet 6 Thr118 - Ile128 
Sheet 7 Tyr145- Ala154 
Sheet 8 Gly160-lle171 
Sheet 9 Ser175- Ile188 
Sheet 10 Asp197 - Asp210 
Sheet 1 1 Arg21 5 - Gly228 



a1 

P1 
02 

P3 

o2 

o3 

a4 

84 

P5 

86 

87 

P8 

P9 

p10 

811 
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Based on the findings disclosed in the examples, it is concluded that appropriate splitting 
sites in GFP are located in the loop regions between the residues that form the beta-sheet 
structures of the GFP beta-barrel Accordingly, splits in GFP are preferably made in the 
Asn23-His25 loop, the Thr38-Gly40 loop, the Lys101-Asp102 loop, the Phe114-Thr118 
5 loop, the Ile128-Tyr145 loop, the Ala154-Gly160 loop, the Ile171-Ser175 loop, the Ile188- 
Asp197 loop or the Asp21Q-Arg215 loop (Table 1, Figure 5) 

The data in the present examples illustrates clearly that the Ala154-Gly160 loop is very 
well suited for GFP reassembly This is particularly the case when the GFP is divided 
between ammo acids Q157 and K158 (that is, when X is 157) Thus, a preferred 
10 embodiment of the invention relates to two GFP fragments, wherein X is 157 within the 
Ala154-Gly160 loop 

The data in the present examples also illustrate that the He171-Ser175 loop is very useful 
for GFP reassembly This is particularly the case, when the GFP is divided between 
amino acids E172 and D173 (that is, when X is 172) Thus, a preferred embodiment of the 
15 invention relates to two GFP fragments, wherein X is 172 within the Ile171-Ser175 loop 

In order to obtain reassembly of the two halves of GFP, it is essential to have the two 
halves of GFP fused to interaction partners that will bring said two halves of GFP so close 
together that the protein halves will fold and form functional GFP (see Hu 2002 supra) 

20 Thus, a preferred embodiment of the invention relates to a fusion protein comprising an N- 
terminal fragment of GFP as described above conjugated to a first protein of interest In a 
particular embodiment the N-termmal fragment of GFP is fused in frame to the first protein 
of interest In similar embodiments, the present invention relates to two GFP fragments as 
described above, wherein the C-terminal fragment of GFP is conjugated to a second 

25 protein of interest In a particular embodiment, the C-terminal fragment of GFP is fused in 
frame to the second protein of interest 

As will be evident to the skilled person, the protein of interest is fused to the GFP 
fragment in the N-termmal or in the C-terminal However, as illustrated in the examples, 
fusion of the first protein of interest to the N-terminal fragment of GFP shall preferably be 
30 to the C-terminal of the N-termmaf fragment of GFP Likewise, fusion of the second 
protein of interest to the C-terminal fragment of GFP shall preferably be to the IM-terminal 
of the C-terminal fragment of GFP 
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As will be evident from the present examples, and eg Hu (2002 supra), the protein of 
interest is a protein, a peptide or a non-proteinaceous partner 

In a typical embodiment of the invention, the fusion protein as descnbed above, wherein 
5 the fragment of GFP is fused in frame to a protein of interest, further comprises a linker 
sequence between either fragment of GFP and the corresponding protein of interest 

The linker must be chosen dependent on the protein of interest conjugated to the 
fragment of GFP Thus the linker must be flexible A long linker prevent stenc hindrance of 
the complementation due to the protein of interest However short linkers keeps the 
10 fragments of GFP closer to each other and gives better associations 

The present invention also relates to the N-terminal fragment of GFP as described above 
In a similar embodiment, the invention relates to the C-tenminal fragment of GFP as 
descnbed above 

15 

A preferred embodiment of the invention relates to a Nucleic acid encoding any of the 
fragments or fusions proteins described above In one embodiment, the nucleic acid 
construct encoding any of the proteins according to the invention described above is a 
DNA construct In another embodiment, the nucleic acid construct encoding any of the 
20 proteins according to the invention descnbed above is a RNA construct 

One aspect of the invention relates to a cell containing the two GFP fragments described 
above In similar embodiments, the invention relates to a cell containing the N-terminal 
fragment of GFP descnbed above In similar embodiments, the invention relates to a cell 
25 containing the C-termmal fragment of GFP descnbed above 

Numerous cell systems for transfectioh exist A few examples of mammalian cells isolated 
directly from tissues or organs taken from healthy or diseased animals (pnmary celts), or 
transformed mammalian cells capable of indefinite replication under cell culture conditions 
(cell lines) The term "mammalian cell" is intended to indicate any living cell of mammalian 
30 origin The cell may be an established cell line, many of which are available from The 
Amencan Type Culture Collection (ATCC, Virginia, USA) or similar Cell Culture 
Collections The cell may be a pnmaiy cell with a limited life span derived from a 
mammalian tissue, including tissues denved from a transgenic animal, or a newly 
established immortal cell line denved from a mammalian tissue including transgenic 
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tissues, or a hybrid cell or cell line derived by fusing different cell types of mammalian 
ongin e g hybndoma cell lines The cells may optionally express one or more non-native 
gene products, e g receptors, enzymes, enzyme substrates, prior to or in addition to the 
fluorescent probe Preferred cell lines include, but are not limited to. those of fibroblast 
5 ongin, e g BHK, CHO, BALB, NIH-3T3 or of endothelial ongin, e g HUVEC, BAE (bovine 
artery endothelial), CPAE (cow pulmonary artery endothelial), HLMVEC (human lung 
micro vascular endothelial cells), or of airway epithelial origin, e g BEAS-2B. or of 
pancreatic ongin. e g RIN. INS-1 , MIN6, bTC3, aTC6. bTC6, HIT, or of hematopoietic 
origin, e g primary isolated human monocytes, macrophages, neutrophils, basophils, 
10 eosinophils and lymphocyte populations, AML-14, AML-193, HL-60. RBL-1, U937. RAW. 
JAWS, or of adipocyte ongin, e g 3T3-L1 , human pre-adipocytes, or of neuroendocnne 
origin, e g AtT20, PC12, GH3, muscle ongin, e g SKMC, A10, C2C12, renal ongin, e g 
HEK 293, LLC-PK1, or of neuronal ongin, e g SK-N-DZ. SK-N-BE(2), HCN-1A. NT2/D1 

15 One aspect of the invention relates to a method for detecting the interaction between two 
proteins of interest comprising the steps of 

(a) providing at least one cell that contains two heterologous conjugates, 

the first heterologous conjugate comprising a first protein of interest conjugated to an 
N-terminal fragment of GFP as described above, 
20 the second heterologous conjugate comprising a second protein of interest conjugated 
to a C-terminal fragment of GFP as descnbed above, and 

(b) measuring the fluorescence from the at least one cell, 

fluorescent cells indicating interaction between the two proteins of interest 

In a similar embodiment, the invention relates to a method for monitoring the interaction 
25 between two proteins of interest compnsing the steps of 

(a) providing at least one cell containing at least one stretch of nucleic acid encoding two 
heterologous conjugates 

the first heterologous conjugate compnsing a first protein of interest conjugated to an 
N-temninal fragment of GFP as described above, 
30 the second heterologous conjugate compnsing a second protein of interest conjugated 
to a C-termmal fragment of GFP as descnbed above, 

(b) cultunng the at least one cell under conditions allowing expression, and 

(c) measunng the fluorescence from the at least one cell, 

fluorescent cells indicating interaction between the two proteins of interest 
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In one aspect of the methods, one of the proteins of interest is known, whereas the other 
protein of interest is an unknown protein By parallel transfection the cells with interaction 
partners to the know protein of interest will be fluorescent and thereby easily detectable 

In a preferred aspect of the methods, the at least one cell is a mammalian cell 

In another preferred aspect of the methods, the heterologous conjugates are fusion 
proteins 



invention will be illustrated more specifically in the following non-limiting examples 



1016OK2 




Page 10 of 36 



Examples 



Example 1' Alignment of fluorescent proteins 


GenBank entrv 


nu orescent protein 




Aequorea victoria green-fluorescent protein 




Renilla reniformis green fluorescent protein 




Renilla muellen green fluorescent protein 


MTU 100^4 


Aequorea macrodactyla isolate GFPxrn 


nroOHDoO 


Montastraea cavernosa green fluorescent protein 


AF401282 


Montastraea faveolata oreen fluorescent nrntom 

wi i iiuvi Cuoui il yji U I till | 


AY015995 


Ptilosarcus sp CSG-2001 green fluorescent protein 


AF322221 


Anemonia sulcata green fluorescent protein asFP499 


AF322222 


Anemonia sulcata nonfluorescent red protein asCP562 


AF246709 


Anemonia sulcata GFP-hke chromoprotein FP595 


AF168419 


DsRed Discosoma sp fluorescent protein FP583 


AF1 68420 


Discosoma stnata fluorescent protein FP483 


AF168421 


Anemonia majano fluorescent protein FP486 


AF1 68422 


Zoanthus sp fluorescent protein FP506 


AF1 68423 


Zoanthus sp fluorescent protein FP538 


AF1 68424 


Clavularia sp fluorescent protein FP484 
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AY015996 rmGFP --- " MSKGEELFTGVVPILVELDGDV 22 

AF372S2S «GFP MSKQII1KNTCLQEVMSYKVNI1EGIV 2S 
AF168419 dsRed — -I MD LAKLGLKEVMPTKINLrEGLV 22 

5 AF322222 asCPS62 — — MRSSKNVIKEFMRFKVRMEGTV 22 

AF168422 asCP562 - -MASFLKKTMPFKTTIEGTV 19 

AF246709 asFP595 " MAQSKHGLTKEMTMKYRMEGCV 22 

AF322221 asFP499 * MASFLKKTMPFKTTIEGTV 19 

AF384683 mcGFP I.:.."." M VPS I KETMRVQLSMEGSV 19 

10 AF401282 mfGFP --- MSVIKPIMEIKLRMQGW 18 

Sltltlt SEE MKCKFVFCLSFLVLAITNANlFliRNEADLEEKTLRIPK^'^ 

AY01599S spGFP Ii::"" MSCS KS VI KE EMLI DLHLEGTF 22 
AF168423 ZSFP538 - MNRNVLKNTGLKEIMSAKASVEGIV 25 
15 AF1S8421 amFP48S --- MAHS KHGLKEEMTMKYHMEGCV 22 
AY013824 amGFPxm — " MALSNKFIGDDMKMTYHMDGCV 22 
. amGFPxm - MSKGEELFTGIVPVL1 ELDGDV 22 

20 AY01599K ^« ^KFSVSGEGEGDATYGKL.-TLKFICTTG-KLPVPWPTLVTTFSYGVQCFSRYPDHMK 79 
a^Iof GFP *^ P ™ BGCGK6NI ^ N Q--LVQIRVTKGAPLPFAFDIVSPAFQYGNR^TFT1CYPNMS 83 

*l\tltl^ ^ SRED NGHEFEIEGEGEGRPY EGHN--TVKLKVTKGGPLPFAWDILSPQFQYGSKVYVKHPADIP 80 
aSCPS62 NGHYFKCTGKGEGNPFEGTQ--EMKIEVlEGGPIiPFAFHIIiSTSCMYGSICTFIIC^SGIP 17 

%lHlii aB l PS9S NG " yFK ^KGBGMPFBGTQ--EM K IEVlEGGPLPFAFHir,STSC^GSKTFIKYVSGrP 77 
AF384«« aSFM 4 ^ P KCTGKGEGKPYEGTQ--S^1TITEGGPLPFAFDILSHAFQYGIK^FaSS« 77 
TSltl mCGFP NGHKFVIKGEGEGKPFEGT0--TINLTVfCBGAPLPFAYDILTSAFOYGNRVFTlCYPm»TP -7fi 

mtllll « GFP NGH 1 EGDG KGKPFBGTQ - - S MDLTVKEGAPL P FA YD I L^VFDYG^VFAKYPQD I P 76 
30 ll^lllt Villi ^^^^^^EGKPYDGTH- -TIiNLEVKEGAPLPFSYDlLSNAf^YGNRALTtWPDDIA lie 
av^fa^ dsPP483 N< 3HYFEIKGKGKGQPNEGTN- -TVTIiEVTKGGPIjPFGWHILCPQFQYGWKAFVIOIPDNIH 11 
«i «a!« *™'*^ F ®' i '' E< * F ^®A'* , ^'GNQ - - liMQI RVTKGGPLPFAFDI VS I AFQYGNRTFTKY^DDIA « 

Weill, XB ll 5 ll NGHKPVITGEGIGYPp KGKQ--TINI^IEGGP^^ B0 
^if!" 1 amFP486 NCSHYFTVKGEGNGKPYEGTQTSTFKVTMANGGPIAFSFDILSTVFKYGNRCF^ B2 

35 A I°"! 2 ! ? mGFPXM hghctswgegegd ^<^--e*k"^ 79 
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P42212 avGFP QHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEV--KFEG DTLVNRIELKGIDPJfRnr' 174 

AY01S996 rmGPP "DYFIQSFPAGFMYERTLRYEDGGIiVEIRSDI- -NLIE DKPVYRVE^KGSNFMDG SJ 

5 t^o^ 19 dSRED " "DYKKLSFPEGFKWERVMNFEDGGWTVTQDS- -SLQD GCFiyKVKFMVTOFPSDG III 

AF168422 asCPS62 - -DYFKNSCPAGYTWDRSFLFEDGAVCICNADITVSVEEN- - -CMYHESKFYGVNFPADG III 
^46709 asFPS95 - -DYFKQSFPEGFTWERTTTYEPGGFLTAHQDT- -SLDG- - -DCLVYKVKIIjG^JNFPADG U0 
AF322221 asFP499 - -DFFKQSLPGGFSWERVSTYEDGGVLSATQET- -SLOG DCIICWKVMiniFPANG III 

mass "sol -n™*"*™™™^ s; 

£n G P " "^PKQTFPEGYSWERSMTYEDQGICVATNDI - -TLMKGVDDCFVYKIRFDGVNFPANG "2 

SSS! 5 s " 4 !i " dyfk Q sfpeg ^h™tfedkgivw^^ m 

AF168420 dsFP483 - -DYLKLSFPEGYTWERSMHFEDGGLCCITNDI- -SLTG- - -NCFYYDIKFTGLNFPPNG 133 

1* ^?« 995 spGFP " dyfv Q sfpagff * e *n^^ HI 

ZSFP53S " DYP ™ SCPAGY ™ GR SFLFEDGAVCI<^ lit 

tllAtll am Zti B6 -- dyfkqafpdgmsyer t f t^^ III 

A Y0 1 3 8 2 4 ^ amG FPXM MNDFFKSAMPEG Y IQERT I FFQDDGKYKTRGEV- - KFEG DTIiVNRI ELKGMDFKEDG 134 

20 aVGPP NIMH ^ EYN ™ SI ^ I ^KQKNGIKVNFKIRHNIEDGSVQIADHYQQM--TPIGDGP 192 
tlllo^l r GFP P ^~ G Q^^MQPSYESMYTN- -VTSVIGECI IAFKLQTGKHFTYHMRTV YKSKKPV 111 

AS2222I astllll p ™;?k™ g ^ as * er ^^ HI 

25 AF .o!!^f asCP562 PVM- KKMTONWE PSCEKI I PVPKQG I tKGDVSMYLLLWMGRLR^FDTV- - - YKAKS VP III 

asFP595 p ™- qn ^ grwepate1wev "Bgvlrg Q sl^ HI 

aSPP4 " P ™- G ^ TrawEp STETVIPR-- D GGUJI^TPALM^^ ^3 

TZIZllll "I*™ " QKKTLKWEPSTEKMYVR- - DGVLKGDVNMALLLEGGGHYRCDFRST YKAKKR- "2 

in ^ll 2 l 2 mfGFP PVM "QKKTLKWEPSTEKMYVR- -DGVI1KGDVNHM1LLEGGGHYRCDFKTT- - - YKAKKF- III 

^ ! S SPP484 PVM " QKKTLKWE P STEIMYVR — DGVLVGD I SHSLLLEGGGH YRCDFKS I YKAKKV - 

Affile dSFM83 PW -Q^ TTGWEp STER£YPR--DGVLIGDIHHAI.TVEGGGHYACDI^---y R AKS A 137 

dllll PVM-QKAI LGMEPSFEWYMN - - SGVLVGEVDI.VYKLESGNYYSCHMKTF YRSKGGV III 

illllll, 2SFP538 ~ KKMTTNWEASCEKIMPVPKQGI MCGDVSMYLLLKDGGRYRCQFDTV YKAKSVP 191 

<ts amPP486 pw - ak ^ g wdpsfekmtvc--dgilkgdvtaflmlqgggnyrcqfhk---yS^ III 

35 AY013824 amGFPXM NILGHIOJEYNFNSHNVYIMPDKANNGIjKVNFKIRHN^GGGVQLADHYQTO- -V "2 

lll^l 2 * avGFP VLLPDNHYL STQSALSKDPNEKRDHMVLI,EFVTAAGITHGMDELYK 238 

40 ^llllll ^! P f^ PI W^KT^-EDGG-FVEQHETAIAQMTSIGKPLGSLHEWV 238 

^ 3 Z! rrGFP ETMPIjYHFIQHRIjVKTNV-DTASGYWQHETAIAAHSTIKKIEGSLP- - - 233 

AF168419 dsRED VQLPGYYYVDSKLDXTSH-NEDYTIVEQYERTEGRHHI1FI1 225 

AF322222 3SCPS62 RKMG ASHRDTL Vll 

AF168422 asCP562 RKMPDWHFIQHKLTREDRSDAKNQKWHLTEHAIASGSAXjP -—- 231 

45 aS ll*ll ^ PGFHFED ^ IEI ^ E -VEKGKCYKQYEAAVGRYCDAAPSKLaHN--- 232 

8SFP499 VKLPELHFHHLRMEKLNI - SDDWKTVEQHES WAS YS - QVPS KLGIC - - - 228 

A £ 3 ! 4 f mcGFP V Q LPDWP ^"RI E ILSH-DNDYNTVKLSEDAEARYSM1,PSQAK-----. 225 

AF401282 mfGFP VQIiPDYHFVDHRIEILSH-DKDYNKVKLYEHAEA-HSGIjPRQAK 227 

illlllll " PP484 VKLPDYHFVDHRrEILNH-DKDYNKVTLYENAVARYSLLPSQA 266 

50 A vnf«« ill ^^^^VIWNN-DKSFMKVEEHEIAVARHHPFYEPKKDK---- 232 

OU AY015995 spGFP KEFPEYHFIHHRLEKTYV- EEGS - FVEQHETAI AQLTTIGKPIiGSLHEWV 23 fl 
AF168423 zsFPS38 SKMPEWHFIQHKLLREDRSDAKNQKKQLTElSlAFPSA^"^---^" 23? 

tlnittll t^lilt ^^^^fARTDLDKGGMS-VQLTEHAVAHZTSVVPF 229 

-- 238 



AY013824 amGFPXM VLIPIMHYLSTQTAISKDRNETRDHMVFLEFFSACGHTHGMDELYK- 



55 
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Example 2 Construction of GFP complementation fragment probes 
Different fragments of EGFP fused to anti-parallel leucine zippers (called NZ and CZ) that 
can bind to each other within prokaryotic and eukaryotic cells were used to evaluate the 
optimal site for splitting EGFP for use of such fragments in molecular complementation 
5 expenments, including bimolecular fluorescence complementation expenments NZ and 
CZ leucine zippers were prepared by annealing and ligating phosphorylated oligo 
nucleotides 2110-21 15 (for NZ zipper, see Table 2) or phosphorylated ofigo nucleotides 
21 16-2121 (for CZ zipper), into Ncol-BamHI cut pTrcHis-A vector (commercially available 
from Invitrogen) producing vector PS1515 (expression vector encoding NZ zipper) or 

10 PS1 516 (expression vector encoding CZ zipper) The oligos tigated in NZ and CZ 

annealing mixes 1 produced the coding sequences of the N-terminal parts of the NZ and 
CZ zippers The oligos ligated in NZ and CZ annealing mixes 2 produced the coding 
sequences of the middle parts of the NZ and CZ zippers and the oligos ligated in NZ and 
CZ annealing mixes 3 produced the coding sequences of the C-terminal parts of the NZ 

15 and CZ zippers 

Annealing pnmer pairs for NZ zipper 



NZ annealing mix 1 

Forward oligo 2 1 1 0 (1 pM) 5 pA 

Reverse oligo 21 1 1 (1 jiM) 5 M | 

50 mM Tris-HCI, 1 0 mM MgCI 2 , pH 8 0 2\x\ 

H 2 0 8m , 

NZ annealing mix 2 

Forward oligo 21 12 (1 pM) 5 M | 

Reverse oligo 21 1 3 (1 \iM) 5 ^ 

50 mM Tris-HCI, 10mMMgCI 2 , pH 8 0 2 pi 

H2O 8 M , 
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N2 annealing mix 3 

Forward ohgo 2114(1 pM) 5 pi 

Reverse oligo 21 1 5 (1 pM) 5 pi 

50 mM Tns-HCI, 10 mM MgC! 2( pH 8 0 2 pi 

H *° 8 p) 



Each of the annealing mixes was heated at 80°C for 2 minutes on a pre-heated Hybaid 
OmniGene PCR machine which was subsequently turned off and allowed to cool to room 
5 temperature (about 10 mm) The fragments were subsequently put on ice 



Annealmg primer pairs for CZ zipper 
CZ annealing mix 1 

Forward oligo 21 16 (1 pM) 5 pi 

Reverse oligo 21 1 7 (1 pM) 5 pi 

50 mM Tns-HCI, 10 mM MgCI 2l pH 8 0 2 pi 
H 2° 8 pi 



CZ annealing mix 2 

Forward oligo 21 18 (1 pM) 5 pi 

Reverse oligo 21 1 9 (1 pM) .5 pi 

50 mM Tns-HCI, 10 mM MgCI 2 , pH 8 0 2 pi 

H 2° 8 pi 



CZ annealing mix 3 

Forward oligo 2120 (1 pM) 5 pi 

Reverse oligo 2121 (1 pM) 5 pi 

f fc0 mM Tns-HCI, 10 mM MgCfe, pH 8 0 2 pi 

l H *° 8 pi 
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Each of the annealing mixes was heated at 80°C for 2 minutes on a pre-heated Hybaid 
OmniGene PCR machine which was subsequently turned off and allowed to cool to roo 
temperature (about 10 mm) The fragments were subsequently put on tee 

Restnctton digestion ofpTrcHis-A prokaryotic expression vector 

5 The pTrcHis-A prokaryotic expression vector cut with Ncol and BamHI restriction 
enzymes and gel punfied was used for cloning the prepared NZ and CZ leucine zipper 
coding sequences 

Restriction digestion of pTrcHis-A vector 

pTrcHis-A (1 ng/jil) 
Ncol (10 U/|il) 
Nhel (5 U/jil). optional 
BamHI (20 U/nl) 
100x BSA 

10x NEB (New England Biolabs, NEB) BamHI buffer 
H a O 

Calf intestinal phosphatase (optional, last 20 mm only) 



2 nl 
1 pi 

0 5 nl 

1 Ml 

0 4 pi 

3 |il 
23 |il 
0 5 ul 



10 The vector was digested for about 1 hour at 37°C and purified by agarose gel 
electrophoresis The desired vector fragment was recovered from the gel using the 
QIAquick Gel Extraction kit (spin columns) from Qiagen and recovered in 50 pJ of elution 
buffer Nhel, which cuts between Ncol and BamHI, was included to minimise the amounts 
of uncut and self-ligating vector 

1 5 Ligation and transformation of annealed NZ oligo pairs 

Each of the three NZ annealing mixtures 1-3 was diluted 50-fold (1 pi of mixture in 50 pi of 
H 2 Q) and mixed and ligated into the cut vector as follows (three hours at 20-24°C) 
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Ligation of NZ zipper fragments into pTrcHis-A vector 

Annealing mix 1 -j ^ 

Annealing mix 2 -j ^ 

Annealing mix 3 -j ^ 

10x T4 DNA ligase buffer (New England Biolabs) 1 M | 

T4 DNA hgase (400 U/pl. New England Biolabs) 0 5 gl 

pTrcHis-A (Ncol + BamHI cut) o 5 nl 

H *0 5 ^ 



Alternatively, the fragments in NZ annealing mixes 1,2, and 3 can be hgated in absence 
of vector and purified by agarose gel electrophoresis before being ligated into the Ncol- 
5 BamHI cut vector The annealed and ligated oligo nucleotides from annealing mixes 1-3 
had single stranded terminal overhangs that were compatible with the overhangs that 
were generated by Ncol and BamHI restnction digestion of pTrcHis-A After ligation of the 
fragment into cut pTrcHis-A. the Ncol and BamHI sites were regenerated 

Following ligation into the vector, 2 ul of the ligation mixture was transformed into 50 of 
10 One Shot TOP10 chemically competent E coll cells (Invitrogen) following the 
manufacturers protocol The ligation can be performed using different amounts or 
volumes fragments and buffers The inserted DNA sequence and the encoded NZ zipper 
peptide are as follows 

MAGGTGSGALKKELQANKKE 
1 5 CCATGGCCGGTGGTACCGGTTCCGGTGCCCTGAAGAAGGAGCTGCAGGCCAACAAGAAGGAG 



LAQLKWELQALKKELAQ*D 
CTGGCCCAGCTGAAGTGGGAGCTGCAGGCCCTGAAGAAGGAGCTGGCCCAGTAGGATCC 



20 The Gly-Gly-Thr-Gly-Ser-Gly amino acid sequence in the terminus is part of the linker 
sequence that was inserted between the NZ zipper peptide and the NEGFP fragment (N- 
terminal fragments of EGFP are called NEGFP) The zipper sequence in the NZ-NEGFP 
fusion protein is also Gly-Gly-Thr-Gly-Ser-Gly with the Gly-Gly-Thr-Gly coding sequence 
being repeated in the NEGFP reverse amplification pnmers 2129, 2130, and 2131 (Table 

25 3) Underlined are the unique Ncol (ccatgg), Agel (accggt) and BamHI (ggatcc) sites 
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used for cloning of the zipper peptide into pTrcH.s-A and the NZ-NEGFP fragments into 
the N2 zipper vector PS1515 (see below) The asterisk (*) shows a stop codon 

Ligation and transformation of annealed CZ oligo pairs 

Each of the three CZ annealing mixtures 4-6 was diluted 50-fold (1 *d of mixture in 50 ul of 
5 H 2 0) and mixed as follows 

Ligation of CZ zipper fragments into pTrcHis-A vector 

CZ annealing mix 1 1 , 

CZ annealing mix 2 1 

CZ annealing mix 3 1 ^ 

10x T4 DNA ligase buffer (New England Biolabs) 1 M | 

T4 DNA ligase (400 ll/pl, New England Biolabs) 0 5 pi 

pTrcHis-A (Ncol + BamHI cut) 0 5 pi 

H *° - 5|d 



Alternatively, the fragments in CZ annealing mixes 1, 2. and 3 can be ligated in absence 
of vector and purified by agarose gel electrophoresis before being ligated into the Ncol- 
10 BamHI cut vector The annealed and ligated oligo nucleotides from annealing mixes 1-3 
had single stranded terminal overhangs that were compatible with the overhangs that 
were generated by Ncol and BamHI restriction digestion of pTrcHis-A After ligation of the 
fragment into cut pTrcHis-A, the Ncol and BamHI sites were regenerated 

Following ligation into the vector, 2 M l of the ligation mixture were transformation into 50 uJ 
15 of One Shot TOP10 chemically competent E coli cells (Invitrogen) following the 
manufacturers protocol The ligation can be performed using different amounts or 
volumes fragments and buffers The inserted DNA sequence and the encoded CZ zipper 
peptide are as follows 



20 



MASEQLEKKLQALEKKLAQL 
CCATGGCCAGCGAGCAGCTGGAGAAGAAGCTGCAGGCCCTGGAGAAGAAGCTGGCCCAGCTG 



EWKNQALEKKLAQGGTG* 
GAGTGGAAGAACCAGGCCCTGGAGAAGAAGCTGGCCCAGGGCGGCACCGGTTAGGATCC 
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The Gly-GIy-Thr-Gly amino acid sequence in the terminus is part of the linker sequence 
that was inserted between the CZ zipper peptide and the CEGFP fragment (C-terminal 
fragments of EGFP are called CEGFP) The zipper sequence in the CZ-CEGFP fusion 
protein is also Gly-Gly-Thr-Gly with the Thr-Giy coding sequence being repeated in the 
5 CEGFP forward amplification primers 2133, 2134, and 2135 (Table 3) Underlined are the 
unique Ncol (ccatgg), Agel (accggt) and BamHI (ggatcc) sites used for cloning of the 
zipper peptide into pTrcHis-A and the CZ-CEGFP fragments into the CZ zipper vector 
PS1516 (see below) The asterisk (*) shows a stop codon 

Example 3 E co/i colony PCR screen, plasmid miniprep and DNA 
10 sequencing 

The transformed bacteria were plated on Luna Broth (LB) agar plates containing 100 
jiig/m! of carbenicillin as selection (present in used E coll media) To quickly identify 
transformants containing the desired NZ or CZ constructs, colony PCR screening was 
performed using oligos 21 10 (5' forward NZ oiigo) and 21 15 (3* reverse NZ oligo) or using 
1 5 oligos 2116 (5 1 forward CZ oligo) and 2121 (3* reverse CZ oiigo) 

Per sample (15 id reaction volume) 

10x Taq polymerase buffer (Perkin Elmer) 1 5 nl 



dNTP (5 mM nucleotide, each) 0 3 pi 

50mMMgCJ 2 0 6|J 

Dimethyl sulphoxide (DMSO) 0 3 jil 

Taq polymerase (Perkin Elmer) 0 2 

5' forward pnmer (10 \M) 0 5 pi 

3' reverse primer (10 \M) 0 5\x\ 

H 2 0 6 1 nl 

Transformant resuspended in H 2 Q 5 0 \i\ 



Cycling parameters (RoboCvcler Gradient 96. Stratagene) 
Initial denaturation at 94°C for 3 mm followed by 25 cycles of (all steps of 1 mm) 
20 Denaturation at 94°C f pnmer annealing at 53°C and pnmer extension at 72°C 
Finally, an additional extension step at 72°C was included (5 mm) 
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16 NZ transfonmants and 16 CZ transformants were screened PCR fragments having the 
expected product sizes of about 120 base pairs were amplified from 14 NZ clones and 15 
CZ clones, as determined by agarose gel electrophoresis analysis 

Three of the positive colonies were picked from each transformation (NZ and CZ) and 
5 used to inoculate 5 ml of liquid LB medium After cultunng at 37°C over night, plasmid 
DNA was punfied by mini preparations using the QIAprep kit from Qiagen 

Plasmids containing correct NZ (PS1515) or CZ (PS1516) fragment inserts were identified 
by DNA sequencing on an ABI PRISM model 377 DNA sequencer using forward 
sequencing primer 1282 



10 Example 4 Prokaryotic expression vectors encoding EGFP fragment/zipper 
fusion proteins 

The DNA sequences encoding the NZ and CZ zippers in the prokaryotic expression 
vectors PS1515 and PS1516, respectively, can be fused to DNA sequences encoding 
desired EGFP fragments (N-terminal fragments of EGFP are called NEGFP and C- 

15 terminal fragments of EGFP are called CEGFP) or other fragments using the unique Agel 
restriction sites appropriately located in linker sequences in either the 5' end (as in the NZ 
vector PS1515) or in the 3' end (as in the CZ vector PS1516) of the leucine zipper coding 
sequence in combination with either of the unique Ncol or BamHI sites used for cloning 
the zipper coding fragments (DNA and amino acid sequences are shown above) The 

20 general structures of the fusion protein coding sequences are shown in Figure 1 

For example, to prepare a prokaryotic expression vector encoding a fusion protein 
consisting of NZ zipper N-terminally fused to an NEGFP fragment, e g residues 1-172. 
this region of the EGFP coding sequence in the commercial expression vector pEGFP-C1 
(Clontech) was amplified by PCR using forward oligo 2128 (containing a unique Ncol site) 
25 and reverse oligo 2131 (containing a unique Agel site) in accordance with Table 3 
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Pgr QAmnlp /SO I il uaHinn irAlnmA\ 




10x Pfu polymerase buffer (Stratagene) 


5 0 til 


dNTP (5 mM nucleotide, each) 


10 nl 


Pfu Hot Start polymerase (Stratagene) 


1 Ojil 


5* forward pnmer {10 uM) 


1 0 Ml 


3' reverse pnmer (10 uM) 


1 Opl 


pEGFP-C1 vector (10 ng/^) 


2 0 nl 




39 0 nl 



Cycling parameters f Hvbaid QmniGene PCR marhmA) 
Initial denaturation at 94°C for 3 mm followed by 25 cycles of (all steps of 1 mm) 
5 Denaturation at 94°C, pnmer annealing at 53°C and pnmer extension at 72°C 
Finally, an additional extension step at 72°C was included (5 mm) 



The PCR fragment encoding the desired EGFP fragment, e g the above mentioned 
fragment composed of residues 1-1 72, with appropnately engineered terminal restnction 
1 0 sites contained in the primer sequences was then gel purified as described above cut with 
Ncol and Agel or Agel and BamHI and ligated into the constructed NZ or CZ prokaryotic 
leucine zipper expression vectors PS1515 or PS1516 cut with the same enzymes and gel 
purified 



15 



Restriction digestion of NEGFP and CEGFP PCR fragments 
EGFP fragment (gel purified) 26 \i\ 

Ncol ( 1 0 U/ M l) or Bam HI (20 U/nl) 0 5 nl 
Agel (10 U/fil) 1 o nl 

10x New England Biolabs buffer 2 3 
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Restriction digestion of NZ (PS1515) and CZ (PS1516) vectors 



Vector (1 ng/nl) 1 o nl 

Ncol (1 0 U/jil) or BamHI (20 U/jil) 0 33 nl 

Agel (10 U/jil) 0 66^1 

1 0x New England Biolabs buffer 2 1 



H a O 7 



All enzymes were from New England Biolabs The DNA preparations were digested for 1 
hour at 37°C and gel purified 

5 Ligation of EGFP fragments into cut PS1 515 or PS1516 vector 



Ligation proceeded for 30 mm at 22"C after which 2 ul of each ligation mixture were 
transformed into 50 fi\ of One Shot TOP10 chemically competent E coli cells (Invitrogen) 
The transformed cells were plated on LB plates containing carbenicillin and plasmids were 
10 prepared from two colonies from each transformation as described above 

Example 5: EGFP based bimolecular fluorescence complementation m E. 
coli 

Plasmids that expressed functional N2-NEGFP or CZ-CEGFP complementation 
constructs were identified by co-transforming 10 uJ of One Shot TOP10 chemically 
15 competent E coli cells (Invitrogen) with 1 \A of each of appropriately matched NZ-NEGFP 
or CZ-CEGFP plasmids (i e , plasmids that express EGFP fragments, said fragments are 
truncated after (NEGFP fragments) or before (CEGFP fragments) the same splitting site 
and plating the co-transformed cells on LB plates containing carbenicillin and 5 mM of 
isopropyl-IJ-thiogalactoside (IPTG) 



Cut and purified vector 

Cut and purified NEGFP or CEGFP fragment 

10x T4 DNA hgase buffer (New England Biolabs) 

T4 DNA hgase (400 U/pl, New England Biolabs) 

H 2 Q 



2 nl 
4 pi 

1 nl 

0 5 pi 

2 5 Hi 
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The transformed cells were grown over night at 37°C E coli colonies that were green 
fluorescent because of EGFP based bimolecular fluorescence complementation were 
visible on the agar plate without magnification about 1 0-20 hours after transfection (the 
fluorescence developed further dunng storage of the plates at 5°C for one or more days) 
5 when illuminated with a blue light source (Fiberoptic-Heim LQ2600) and viewed through 
yellow filter glasses 



Functional complementation was clearly visible in cells co-transformed with 
complementation constructs based on splits between either residues 157 and 158 or 
between residues 172 and 173 and the DNA sequences of expression vectors that 
10 produced functional NZ-NEGFP or CZ-CEGFP complementation fragments (named 
PS1594, PS1595, PS1596, PS1597, see Table 4) were venfied by DNA sequencing usir 
primer 1282 as previously described 

Surpnsingly. the E coli colonies of cells co-transformed with the vectors expressing the 
EGFP complementation fragments with split in the Ile171-Ser175 loop (namely between 
1 5 residues 1 72 and 1 73, vectors PS1 595 and PS1 597) were significantly more fluorescent 
than the colonies of cells that were co-transformed with vectors expressing EGFP 
complementation fragments that were split in the Ala154-Gly160 loop (namely between 
residues 157 and 158. vectors PS 1594 and PS 1596) 

Functional complementation was not clearly visible in cells co-transformed with 
20 complementation constructs based on a split between residues 144 and 145 DNA 

sequencing confirmed that expression vectors PS1614 and PS1615 encoded the correct 
NZ-NEGFP and CZ-CEGFP complementation fragments, respectively 



Example 6: Eukaryotic expression vectors encoding EGFP fragment/zipper 
fusion proteins 

25 Because of the low fluorescence signal produced by the complementation fragments 
based on the 144/145 split fragments, only the complementation fragments that were 
based on splits at residues 157/158 or 172/173 were transferred to a eukaryotic 
expression system to permit evaluation of fragment complementation in mammalian cells 

NZ-NEGFP fragments in PS1596 and PS1597. and CZ-CEGFP fragments in PS1594 and 
30 PS1595, are flanked by an Ncol site 5' to the start codons and a BamHI site 3' to the stop 



1016DK2 _ „ _ 

Page 23 of 36 



codons The fragments were transferred as blunt-ended Ncol/BamHI fragments into 
mammalian expression vectors cut with Eco47lll/BamHI To select for stable expression 
of both an NZ-NEGFP and a C2-CEGFP expressing plasmid, the expression vectors for 
N2-NEGFP fragments and CZ-CEGFP fragments contain selection markers for 
5 neomycm/geneticin/G4l8 and zeocm, respectively 

Plasmids PS1594, PS1595, PS1596. and PS1597 were cut with Ncol restriction enzyme, 
blunt-ended with Klenow fragment, gel punfied, cut with BamHI and gel purified as 
described below 



Restnction digestion of NZ-NEGFP a nd CZ-CEGFP prokarvotic expression vectors 

PS1594, PS1 595. PS1596, fir PS1597 (1 ng/jil) 1 }x\ 
Ncol (10 U/nl, from New England Biolabs) 1 pi 

10x buffer 4 (NEB) 3 ^ 

Hz ° 25 nl 

10 



The plasmids were digested for about 1 hour at 37°C 1 ul of 1 mM dNTP mix and 1 unit 
of Klenow fragment (New England Biolabs) were added and the reactions were incubated 
30 minutes at room temperature The linear plasmid fragments were purified by agarose 
gel electrophoresis and recovered from the gel using the QIAquick Gel Extraction kit (spin 
15 columns) from Qiagen and recovered in 50 uJ of elution buffer 5 ul BamHI buffer (New 
England Biolabs) and 10 units BamHI enzyme were added The plasmids were digested 
for about 1 hour at 37°C The desired plasmid fragments were punfied by agarose gel 
electrophoresis and recovered from the gel using the QIAquick Gel Extraction kit (spin 
columns) from Qiagen and recovered in 50 uJ of elution buffer 

20 To stably co-express NZ-NEGFP and CZ-CEGFP fragments in the same mammalian cell, 
mammalian expression vectors carrying different selection markers were required To 
obtain this, the kanamycin/neomycm selection marker on the expression vector pEGFP- 
C1 was replaced with a zeocm resistance marker resulting in the plasmid referred to as 
PS0609 
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Replacement of kanamycin/neomycin marker on pEGFP-C1 with zeocin marker 

pEGFP-CI was digested with Avrll, which excises the kanamycin/neomycin selection 
marker, and following gel purification, the vector fragment was hgated with an 
approximately 0 5 kbp Avrll fragment encoding zeocin resistance This fragment was 
5 isolated by PCR amplification of the zeocin selection marker on plasmid pZeoSV 
(Invitrogen) using primers 9655 and 9658 (see Table 2) Both primers contain Avrll 
cloning sites and flank the zeocin resistance gene on plasmid pZeoSV including its E coh 
promoter The top primer 9658 spans the Asel site at the beginning of zeocin, which can 
be used to determine the orientation of the Avrll insert relative to the SV40 promoter 
10 which drives resistance in mammalian cells The resulting plasmid is referred to as 
PS0609 

Plasmids pEGFP-C1 (Clontech) and its zeocin-resistant derivative PS0609 were cut with 
Eco47lll restriction enzyme, gel purified, cut with BamHI and gel purified as described 
below These steps excise EGFP and leave the rest of the vectors intact 

15 Restriction digestion of eukarvotic expression vectors 

pEGFP-CI or PS0609 DNIA (1 jig/^l) 0 5 \i\ 
Eco47IM (10 U/pl, from Promega) 1 m' 
10x buffer D (Promega) 3 \i\ 

H 2 0 25 5 ix\ 

The plasmids were digested for about 1 hour at 37°C The linear plasmid fragments were 
punfied by agarose gel electrophoresis and recovered from the gel using the QIAquick 
Gel Extraction kit (spin columns) from Qiagen and recovered in 50 jil of elution buffer 5 \x\ 
20 BamHI buffer (New England Biolabs) and 10 units BamHI enzyme were added The 
plasmids were digested for about 1 hour at 37°C The desired vector fragments were 
purified by agarose gel electrophoresis and recovered from the gel using the QIAquick 
Gel Extraction kit (spin columns) from Qiagen and recovered in 50 \i\ of elution buffer 
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Ligat ion of NZ-NEGFP fragments into DEGFP-C1 and CZ-CEGFP fragments into PSOfifiQ 
Cut and purified vector fragment 1 ^ 

Cut and punfted NZ-NEGFP or CZ-CEGFP fragment 3 ^| 
10x T4 DNA ligase buffer (New England Biolabs) 1 u | 
T4 DNA ligase (400 U/ul, New England Biolabs) 0 5 ul 



Ligation reactions were incubated at 16°C overnight 3 ul were transformed into One Shot 
TOP10 chemically competent E coll cells (Invitrogen) and transformants were selected on 
5 imMedia with kanamycm or imMedia with zeocm (both from Invitrogen) for pEGFP-C1 and 
PS0609 derivatives, respectively 

4 transformants from each transformation plate were picked in imMedia medium with 
appropriate selection (kanamycm or zeocm) and grown at 37 degrees C for 6 hours 
Plasmid DNA was isolated by the QIAprep spin column method (Qiagen) and analysed by 
1 0 restriction digests with Asel and Mlul The DNA sequences of the inserts were finally 
verified by sequencing as described above The resulting plasmids were named PS1557, 
PS1558, PS1559, and PS1560 (Table 4) 



Example 7: EGFP based bimolecular fluorescence complementation in 
mammalian cells 

1 5 To establish cells lines that express EGFP fragment/zipper fusion proteins, CHO-hIR cells 
were transfected with plasmid pairs resulting in two cell lines 1) CHO-hIR 
PS1559+PS1557. and 2) CHO-hIR PS1560+PS1558 The CHO-hIR cell line consists of 
CHO-K1 (ATCC CCL-61) cells that have been stably transfected with the human insulin 
receptor (h IR, GenBankAcc#M 10051) as described in Hansen, B F , Danielsen, G M, 

20 Drejer, K , Sorensen, A R . Wiberg, F C , Klein, H H . Lundemose, A G (1996) 
Sustained signalling from the insulin receptor after stimulation with insulin analogues 
exhibiting increased mitogenic potency Biochem J Apr 1, 315 ( Pt 1) 271-279 The 
selection marker for the vector is methotrexate (MTX) The hIR expression is very stable 
in the CHO-hIR cells, without selection pressure, because of the insulin-sensitivity of the 

25 cell line and a very stable phenotype can be maintained without selection pressure 
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Stable cells were obtained by cell growth in selection medium containing Genettcm and 
Zeocin 

CHO-hIR cells were transfected using Fugene (Roche) according to the manufacturer's 
instructions The day after transfection, cells were examined for transient expression, split 
5 110 and exposed to selection medium (growth medium supplemented with 500 Mg/ml 
genetian (Invitrogen) and 1 mg/ml zeocin (Cayla) The ceJIs lines were stable after 2-3 
weeks of culture in selection medium 

The growth medium used was NUT MIX F-12 (Ham's) with GLUTAMAX-1 
(Gibco/lnvitrogen) supplemented with 10% fetal bovine serum (JRH Biosciences) and 1% 
10 Penicillin-Streptomycin (10,000 lU/ml, Gibco/lnvitrogen) The CHO-hIR cells were cultured 
in growth medium, and split 1 4 to 1 16 twice a week according to standard cell culture 
protocols The CHO-hIR PS1559+PS1557 and CHO-hIR PS1560+PS1558 were treated 
likewise, except that the growth medium was supplemented with 500 pg/ml genetian 
(Invitrogen) and 1 mg/ml zeocin (Cayla) at all times 

15 Images of three CHO cell lines separately transfected with pEGFP-C1 (expressing EGFP 
with a short C-termmal extension), PS1559 + PS1557 (expressing EGFP 
complementation fragments split at 157-158) and with PS 1560 + PS1558 (expressing 
EGFP complementation fragments split at 172-173) were collected 1 day, 2 days and 10 
days after transfection to assess the relative brightness of cells expressing the different 

20 complementation constructs Images were collected on a Nikon Diaphot 300 equipped for 
eptfluorescence work Light source for epifluorescence was a Nikon 1 00W Hg arc lamp, 
coupled to the microscope through a custom quartz fibre illuminator (TILL Photonics 
GmbH, Planegg, Germany) Excitation light passed through a 450-490 nm bandpass filter 
(Delta Light and Optics, Lyngby, Denmark) and was directed to the specimen via a 

25 Chroma 72100 505 nm cut-on dichroic mirror (Chroma Technology, Brattleboro, VT, 
USA) A x40 NA1 3 oil immersion lens was used for all images Emitted light passed 
through a 540-550 bandpass filter (Chroma) to a Hammamatsu Orca ER camera All 
images were collected with 50 millisecond exposure time, chosen to ensure non- 
saturation of images for even the brightest (EGFP-expressing) cells in each optical field 

30 (maximum pixel count <4095) Imaging software used to acquire images on this system 
was IPLab for Windows (Scanalytics, USA) 
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Presentation and analysis of images 

The microscope images were analysed using the ImageJ software package, the public 
domain image analysis software written by Wayne Rasband of the US National Institute of 
Health (http //rsb info nih gov/ij/) and the data analysis was performed in Microsoft Excel 
5 The images shown in Figure 2 are of fluorescent CHO-hIR cells co-transfected with 
different NZ-NEGFP and CZ-CEGFP expression vectors or transfected with pEGFP-C1 
The images are scaled individually to visualise the cells and the fluorescence distribution 
within them Because of this scaling, the relative fluorescence levels cannot be compared 
between the images When the same images are scaled identically they appear as in 
10 Figure 3 and it is apparent that the cells that are transfected with complementation 

constructs that are based on a split between residues 172 and 173 are significantly more 
fluorescent than the cells that are transfected with complementation constructs that are 
based on a split between residues 157 and 158 However, the cells transfected with the 
pEGFP-C1 construct show significantly stronger fluorescence on day 2 

15 The same images were analysed for background and maximum fluorescence intensities 
using the ImageJ software package (Figure 4) From the figure, it is clear that a split 
between residues 172 and 173, and probably anywhere else in this loop, is greatly 
superior to a split between residues 157 and 158 and probably also to splits anywhere 
else in this loop 



20 
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Figure legends 

Figure 1 

General structures of the fusion protein coding sequences 
Figure 2 

5 16 bit images of fluorescent CHO-hIR cells co-transfected with NZ-NEGFP and CZ- 
CEGFP expression vectors or transfected with pEGFP-C1 were taken and scaled 
individually to visualise the cells and the fluorescence distribution within them Because of 
the pixel intensity scaling, the relative fluorescence levels cannot be compared among the 
images The splitting sites are either at residues 157/158 (top row, plasmids PS1557 and 
10 PS1559) or at residues 172/173 (middle row, plasmids PS1558 and PS1560) The EGFP 
expression vector pEGFP-C1 was transfected into the cells in the bottom row The images 
were taken 1 day (left column), 2 days (middle column), or 10 days (right column) after 
transfection The images of the cells are representative of the cells that expressed 
functionally complementing fragments 

15 Figure 3 

The same 16 bit images of fluorescent CHO-hIR cells co-transfected with NZ-NEGFP and 
CZ-CEGFP expression vectors or transfected with pEGFP-C1 as shown in Figure 2 but 
the images are now shown with the same intensity scaling to allow companson of 
fluorescence intensities The cells that are transfected with complementation constructs 
20 that are based on a split between residues 172 and 173 (middle row) are clearly more 
fluorescent than the cells that are transfected with complementation constructs that are 
based on a split between residues 157 and 158 (top row) However, the cells transfected 
with the pEGFP-C1 construct (bottom row) show significantly stronger fluorescence at day 
2 

25 Figure 4 

The un-manipulated microscope images shown in Figure 3 were analysed using the 
ImageJ software package and data analysis was performed in Microsoft Excel For each 
16-bit monochrome IP Lab microscope image, pixel intensity data were produced in 
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ImageJ and exported to an Excel spread-sheet for data analysis The darkest and 
brightest 0 5% of the pixels were identified in each image and the average intensities of 
these two groups of pixels were calculated The average intensity of the 0 5% darkest 
pixels was defined as the back ground fluorescence intensity (shown as white bars in the 
5 histogram) and the intensity of the 0 5% brightest pixels was defined as the maximum 
intensity The difference in intensity between the maximum intensity and the background 
intensity was defined as the response (shown as cross hatched bars in the histogram) 
The sum of the background intensity and the response is equal to the maximum intensity 
From the figure, it is clear that EGFP based fluorescence complementation using a split 
10 between residues 172 and 173, and probably anywhere else in this loop, is greatly 
superior to EGFP based fluorescence complementation using a split between residues 
157 and 158 and probably also to splits anywhere else in this loop 

Figure 5 

Positions of appropriate fluorescent protein splitting sites are shown on ribbon and wire 
1 5 frame representations of GFP The two representations show the same sites from two 
sides (molecule rotated approximately 180 degrees around a vertical axis) 
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Tables 



Table 2 Ohgo nucleotides used in cloning Oligo nucleotides beginning with P* are 
phosphorylated at the 5' end to permit ligation 



Ohgo nucleotide 


Ohgo nucleotide sequence (5' end to 3' end) 


1282 


CAGACAATCTGTGTGGGCACTCGACCGG 


2110 


P'CATGGCCGGTGGTACCGGTTCCGGTGCCCTGAAGAAGGAGCTGCAGG 


2111 


P*AGCTCCTTCTTCAGGGCACCGGAACCGGTACCACCGGC 


2112 


P*CCAACAAGAAGGAGCTGGCCCAGCTGAAGTGGGAGCTGCAG 


2113 


P*CTCCCACTTCAGCTGGGCCAGCTCCTTCTTGTTGGCCTGC 


2114 


P'GCCCTGAAGAAGGAGCTGGCCCAGTAG 


2115 


P*GATCCTACTGGGCCAGCTCCTTCTTCAGGGCCTGCAG 


2116 


P*CATGGCCAGCGAGCAGCTGGAGAAGAAGCTGCAGGCCCTG 


2117 


P'CCTG CAGCTTCTTCTCCAGCTGCTCGCTGGC 


2118 


P*GAGAAGAAGCTGGCCCAGCTGGAGTGGAAGAACCAGGCCCTGGAG 


2119 


P*GGCCTGGTTCTTCCACTCCAGCTGGGCCAGCTTCTTCTCCAGGG 


2120 


P*AAGAAGCTGGCCCAGGGCGGCACCGGTTAG 


2121 


P*GATCCTAACCGGTGCCGCCCTGGGCCAGCTTCTTCTCCAG 


2128 


GGCGCCATGGTGAGCAAGGGCGAG 


2129 


GCCGGACCGGTACCACCGTTGTACTCCAGCTTGTG 


2130 


GCCGGACCGGTACCACCCTGCTTGTCGGCCATG 


2131 


GCCGGACCGGTACCACCCTCGATGTTGTGGCGGATC 


2132 


CCCCGGATCCTACTTGTACAGCTCGTCCATGC 


2133 


GGCGCCATGGGCACCGGTTACAACAGCCACAACGTC 


2134 


GGCGCCATGGGCACCGGTAAGAACGGCATCAAGGTG 


2135 


GGCGCCATGGGCACCGGTGACGGCAGCGTGCAGCTC 


9655 


TCCTAGGTCAGTCCTGCTCCTCGGCCACGAAGTGCAC 




TCCTAGGCTGCAGCACGTGTTGACAATTAATCATCGG 


9658 


CAGACAATCTGTGTGGGCACTCGACCGG 
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Table 3 Primer pairs used in EGFP fragment amplification 



Protein encoded by PCR fragment 


5' primer 


3' primer 


EGFP(1-144) 


2128 


2129 


EGFP(1-157) 


2128 


2130 


EGFP(1-172) 


2128 


2131 


EGFP( 145-238) 


2133 


2132 


EGFP(1 58-238) 


2134 


2132 


EGFP{ 173-238) 


2135 


2132 



Table 4 Cloning and expression vectors 



Vector 


Expressed protein 


Promotor 


Selection 
E coli/mamm 


PEGFP-C1 


EGFP 


CMV 


kan/neo 


PS0609 


EGFP 


CMV 


zeo/zeo 


pTrcHis-A 


no insert 


Trc 


amp/none 


PS1515 


NZ leucine zipper 


Trc 


amp/none 


PS1516 


CZ leucine zipper 


Trc 


amp/none 


PS1614 


NZ-NEGFP(1-144) 


Trc 


amp/none 


PS1596 . 


NZ-NEGFP(1-157) 


Trc 


amp/none 


PS 1597 


NZ-NEGFP(1-172) 


Trc 


amp/none 


PS1615 


CZ-CEGFP(1 45-238) 


Trc 


amp/none 


PS1594 


CZ-CEGFP< 158-238) 


Trc 


amp/none 


PS1595 


CZ-CEGFP(1 73-238) 


Trc 


amp/none 


PS 1559 


NZ-NEGFP(1-157) 


CMV 


kan/neo 


PS1560 


NZ-NEGFP(1-172) 


CMV 


kan/neo 


PS 1557 


CZ-CEGFP(1 58-238) 


CMV 


zeo/zeo 


PS 1558 


CZ-CEGFP(1 73-238) 


CMV 


zeo/zeo 



1016DK2 # ttfestyr^sen • Pa9e 32 ° f 36 

1 3 JUMI 2002 
Modtaget 

Sequences 

SEQ ID 1 Amino acid sequence of GFP 

MSKGEELFTGWPILVBLDGDVNGHKFSVSGEGEGDATYGKLTLKFICTT 
5 GKLPVPWPTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFF 
KDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNV 
Y I MADKQKNGI KVNFKI RHN I EDGS VQLADH YQQNT P I GDGPVLLPDNH Y 
LSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK 

10 

SEQ ID 2 Ammo acid sequence of GFP Y66W 

MS KGEE L FTG WP I LVELDGD VNGHKFS VSGEGEGD AT YGKLTLKF I CTT 
GKLPVPWPTLVTTFSWGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFF 
1 5 KDDGNYKT RAE VKFEGDTLVNR I E L KG I D FKEDGN I LGHKLE YNYNSHNV 
YIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHY 
LSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK 

SEQ ID 3 Ammo acid sequence of GFP Y66H 

20 

MSKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTT 
GKLPVPWPTLVTTFSHGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFF 
KDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNV 
Y I MADKQKNGI KVNFKI RHNI EDGS VQLADHYQQNTP IGDGPVLL PDNHY 
25 LSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK 
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Claims 

1 Two GFP fragments comprising 

(a) an N-terminal fragment of GFP, comprising a continuous stretch of amino acids from 
amino acid number 1 to amino acid number X of GFP, wherein the peptide bond between 

5 amino acid number X and amino acid number X+1 is within a loop of GFP and 

(b) a C-termmal fragment of GFP, comprising a continuous stretch of amino acids from 
amino acid number X+1 to amino acid number 238 of GFP 

2 Two GFP fragments according to the preceding claim, wherein GFP is selected from 
the group consisting of EGFP, EYFP, ECFP. dsRed and Renilla GFP 

10 3 Two GFP fragments according to any of the preceding claims, wherein the GFP is 
EGFP 

4 Two GFP fragments according to any of the preceding claims, wherein X is between 9 
and 10 within the Thr9-Val1 1 loop, or between 23 and 24 within the Asn23-His25 loop, or 
between 38 and 39 within the Thr38-Gly40 loop, or between 48 and 55 within the Cys48- 

15 Pro56 loop, or between 72 and 75 within the Ser72-Asp76 loop, or between 81 and 82 
within the His81-Phe83 loop, or between 88 and 89 within the Met88-Glu90 loop, between 
101 and 102 within the Lys101-Asp103 loop, or between 114 and 117 within the Phe114- 
Thr1 18 loop, or between 128 and 144 within the He 128-Tyr145 loop, or between 154 and 
159 within the Ala154-Gly160 loop, or between 171 and 174 within the Ite171-Ser175 

20 loop, or between 188 and 196 within the Ile188-Asp197 loop, or between 210 and 214 
within the Asp2l0-Art215 loop 

5 Two GFP fragments according to the preceding claim, wherein X is between 154 and 
159 within the Ala154-Gly160 loop 

6 Two GFP fragments according to the preceding claim, wherein X is 1 57 within the 
25 Ala154-Gly160 loop 

7 Two GFP fragments according to the preceding claim, wherein X is between 171 and 
174 within the Ile171-Ser175 loop 
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8 Two GFP fragments according to any of the preceding claims, wherein X is 172 within 
in He171-Ser175loop 

9 Two GFP fragments according to any of the preceding claims, wherein the N-terminal 
fragment of GFP is fused in frame with a first protein of interest 

5 1 0 Two GFP fragments according to any of the preceding claims, wherein the first protein 
of interest is fused to the N-terminal of the N-termmal fragment of GFP 

1 1 Two GFP fragments according to any of the preceding claims, wherein the first protein 
of interest is fused to the C-terminal of the N-terminal fragment of GFP 

12 Two GFP fragments according to any of the preceding claims, wherein the C-termmal 
10 fragment of GFP is fused in frame with a second protein of interest 

13 Two GFP fragments according to any of the preceding claims, wherein the second 
protein of interest is fused to the N-terminal of the C-terminal fragment of GFP 

14 Two GFP fragments according to any of the preceding claims, wherein the second 
protein of interest is fused to the C-terminal of the C-termmal fragment of GFP 

15 15 Two GFP fragments according to any of the preceding claims, wherein the N-terminal 
fragment of GFP fused in frame to a first protein of interest further compnses a linker 
sequence between the N-terminal fragment of GFP and the first protein of interest 

16 Two GFP fragments according to any of the preceding claims, wherein the C-terminal 
fragment of GFP fused in frame to a second protein of interest further comprises a linker 

20 sequence between the C-terminal fragment of GFP and the second protein of interest 

17 The N-terminal fragment of GFP according to any of the preceding claims 

18 The C-terminal fragment of GFP according to any of the preceding claims 

19 Nucleic acid encoding a fragment according to any of the preceding claims 
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20 A cell composing an N-terminal fragment of GFP according to any of the preceding 
claims 

21 A cell comprising a C-terminal fragment of GFP according to any of the preceding 
claims 

5 22 A cell composing the two GFP fragments according to any of the preceding claims 

23 A vector comprising the two GFP fragments according to any of the preceding claims 

24 A vector comprising the ^terminal fragment of GFP according to any of the preceding 
claims 

25 A vector comprising the Oterminal fragment of GFP according to any of the preceding 



26 A plasmid comprising the two GFP fragments according to any of the preceding 
claims 

27 A plasmid comprising the N-terminal fragment of GFP according to any of the 
preceding claims 

1 5 28 A plasmid comprising the C-termmal fragment of GFP according to any of the 
preceding claims 

29 A method for detecting the interaction between two proteins of interest comprising the 
steps of 

(a) providing at least one cell that contains two heterologous conjugates, 

20 the first heterologous conjugate compnsing a first protein of interest conjugated to an 
N-terrninal fragment of GFP according to any of the preceding claims, 
the second heterologous conjugate comprising a second protein of interest conjugated 
to a Oterminal fragment of GFP according to any of the preceding claims, and 

(b) measunng the fluorescence from the at least one cell, 

25 fluorescent cells indicating interaction between the two proteins of interest 



10 
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30 A method for monitoring the interaction between two proteins of interest comprising 
the steps of 

(a) providing at least one eel! containing at least one stretch of nucleic acid encoding for 
two heterologous conjugates, 

5 the first heterologous conjugate compnsing a first protein of interest conjugated to an 
N-teiminal fragment of GFP according to any of the preceding claims, 
the second heterologous conjugate comprising a second protein of interest conjugated 
to a C-terminal fragment of GFP according to any of the preceding claims, 

(b) cultunng the at least one cell under conditions allowing expression, and 
10 (c) measuring the fluorescence from the at least one cell, 

fluorescent cells indicating interaction between the two proteins of interest 

31 A method according to any of the preceding claims, wherein the at least one cell is a 
mammalian cell 
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